WE CLAIM: 



1. A method of performing a k-stage FFT (fast Fourier 
transform) computation of N data points wherein k and N are 
integers, the method comprising: 

performing a plurality of operations upon the N data 
points, each one of the plurality of operations comprising; 

performing an import of a respective plurality of 
sets of data points of the N data points from an external 
memory into an internal memory; 

performing a FFT computation of the k-stage FFT 
computation upon each one of the sets of data points in the 
internal memory; and 

performing an export of the respective plurality of 
sets of data points of the N data points from the internal 
memory into the external memory to update the respective sets 
of data points of the N data points in the external memory. 

2. A method of performing a k-stage FFT computation of N 
data points wherein k and N are integers, the method 
comprising : 

for each one of a plurality of stage groupings of at 
least one stage of k stages of the k-stage FFT computation 
wherein the plurality of stage groupings of at least one stage 
collectively comprise the k stages, performing a respective 
plurality of operations on the N data points, each one of the 
respective plurality of operations comprising: 

performing an import of a respective plurality of 
sets of data points of the N data points from an external 
memory into an internal memory; 



performing a FFT computation upon each one of the 
respective plurality of sets of data points of the N data 
points in the internal memory; and 

performing an export of the respective plurality of 
sets of data points of the N data points from the internal 
memory into the external memory to update the respective 
plurality of sets of data points of the N data points in the 
external memory . 

3. A method of performing a k-stage FFT computation 
according to claim 2 comprising performing an import of a next 
respective plurality of sets of data points of the N data 
points for a next one of the respective plurality of operations 
while the performing a FFT computation upon each one of the 
respective plurality of sets of data points of the N data 
points in the internal memory. 

4. A method of performing a k-stage FFT computation 
according to claim 2 comprising performing an export of a 
previous respective plurality of sets of data points of the N 
data points for a previous one of the respective plurality of 
operations while the performing a FFT computation upon each one 
of the respective plurality of sets of data points of the N 
data points in the internal memory. 

5. A method of performing a k-stage FFT computation 
according to claim 2 wherein the FFT computation is performed 
using decimation in frequency (DIF) . 

6. A method of performing a k-stage FFT computation 
according to claim 2 wherein the FFT computation is performed 
using decimation in time (DIT) . 



7. A method of performing a k-stage FFT computation 
according to claim 2 wherein N substantially satisfies N = 2 k . 

8. A method of performing a k-stage FFT computation 
according to claim 7 wherein the stage groupings of at least 
one stage comprise Q stage groupings each comprising S stages, 
said Q and said S being integers with Q>1, S>1 and k>QS. 

9. A method of performing a k-stage FFT computation 
according to claim 8 comprising: 

grouping, within a stage grouping, I, of the Q stage 
groupings wherein I is an integer satisfying 0<I<Q-1, the N 
data points into 2 IS blocks of data of N/2 IS respective data 
points; 

grouping, within each one of the 2 blocks of data, 
the N/2 IS respective data points into 2 s sub-blocks of data of 
N/2 IS+S respective data points; 

grouping, within each one of the 2 s sub-blocks of 
data, the N/2 IS+S respective data points into Mi sub-sub-blocks 
of data of N/(2 IS+S Mi) respective data points wherein Mi is an 
integer; 

taking, within each one of the 2 IS blocks of data, a 
respective sub-sub-block of data of the Mi sub-sub-blocks of 
data from each one of the 2 s sub-blocks of data to form a set o 
2 s sub-sub-blocks of data of N/(2 IS Mi) data points a total of Mi 
times to produce Mi sets of 2 s sub-sub-blocks of data of 
N/(2 IS Mi) data points; and 

taking, within each one of the M x sets of 2 s sub-sub- 
blocks of data, a respective data point from each one of the 2 s 



sub-blocks of data to form a set of 2 s data points a total of 
N/(2 IS+s Mi) times to produce N/(2 IS+s Mi) sets of 2 s data points. 

10. A method of performing a k-stage FFT computation 
according to claim 9 wherein, for the stage grouping, I, of the 
of the Q stage groupings of the plurality of stage groupings, 
the performing a respective plurality of operations on the N 
data points comprises performing Mi of the plurality of 
operations for each one of the 2 IS blocks of data within the 
stage grouping, I, each one of the Mi of the plurality 
operations comprising : 

performing an import of respective N/(2 IS+s Mi) sets of 
2 s data points from the external memory into the internal 
memory; 

performing an S-stage FFT computation of the k-stage 
FFT computation upon each set of 2 s data points of the 
respective N/(2 IS+s Mi) sets of 2 s data points in the internal 
memory; and 

performing an export of the respective N/(2 IS+s Mi) sets 
of 2 s data points from the internal memory into the external 
memory to update the respective N/(2 IS+s Mi) sets of 2 s data 
points in the external memory. 

11. A method of performing a k-stage FFT computation 
according to claim 10 wherein M x substantially satisfies Mi = 
N/(2 IS Mi) wherein M T corresponds to the number of data points the 
internal memory can hold. 

12. -A method of performing a k-stage FFT computation 
according to claim 10 wherein Mi substantially satisfies M x = 
N/(2 IS_1 Mi) wherein Mi corresponds to the number of data points a 
buffer in the internal memory can hold. 
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13. A method of performing a k-stage FFT computation 
according to claim 10 wherein the performing an S-stage FFT 
computation of the k-stage FFT computation upon each set of 2 s 
data points of the respective N/<2 IS+s Mi) sets of 2 s data points 

5 results in N/(2 IS+S Mi) S-stage FFT computations, the N/(2 IS+S M x ) 
S-stage FFT computations comprising performing one stage of S 
stages for each one of the N/(2 IS+S Mi) S-stage FFT computations 
and iterating through the S stages. 

14. A method of performing a k-stage FFT computation 
10 according to claim 10 wherein the performing an S-stage FFT 

computation of the k-stage FFT computation upon each set of 2 s 
5 data points of the respective N/(2 IS+s Mi) sets of 2 s data points 

F|J results in N/(2 IS+S M x ) S-stage FFT computations, the N/ (2 IS+S Mi) 

S-stage FFT computations comprising performing computations for 
15 S stages of an S-stage FFT computation of one of the N/(2 IS * S Mi) 
f S-stage FFT computations before another S-stage FFT computation 

hj is performed on another of one of the N/(2 IS+S Mi) S-stage FFT 

nJ 

~ m computations. 

M 15. A method of performing a k-stage FFT computation 

20 according to claim 8 wherein the stage groupings of at least 

one stage comprise a stage grouping comprising D stages wherein 
D is an integer and k=QS+D. 

16. A method of performing a k-stage FFT computation 
according to claim 15 wherein the performing an import 

25 comprises importing the plurality of sets of data points into a 
buffer in the internal memory, the buffer having a capacity to 
hold M T data points and D being an integer substantially 
satisfying D<log 2 (Mi) . 

17. A method of performing a k-stage FFT computation 
30 according to claim 15^wherein the performing an import 



78945-24 





rU 



38 



comprises importing the plurality of sets of data points into 
one of two buffers of a double buffer in the internal memory, 
each one of the two buffers having a capacity to hold M T /2 data 
points and D being an integer substantially satisfying 
5 D<log 2 (Mi/2) . 

18. A method of performing a k-stage FFT computation 

according to claim 15 comprising, for each one of a plurality 
of sub-stage groupings of at least one stage of the stage 
grouping of D stages wherein the plurality of sub-stage 
10 groupings of at least one stage collectively comprise the D 
M ; stages, performing a set of sub-stage FFT computations wherein 

for each one of the sub-stage FFT computations respective data 
points and respective twiddle factors are loaded into a high- 
yj speed cache. 

si 



15 19. A processing platform implementing the method of 

if claim 2. 

* 2 1 

OV 20. A processor comprising: 

^ an internal memory adapted to store data; 

a DMA (direct memory access) unit adapted to import 
20 data into the internal memory and to export data from the 
internal memory; and 

a central processor unit (CPU) adapted to perform, 
for each one of a plurality of stage groupings of at least one 
stage of k stages of the k-stage FFT computation wherein the 
25 plurality of stage groupings of at least one stage collectively 
comprise the k stages, a respective plurality of operations on 
the N data points, for each one of the respective plurality of 
operations the CPU is adapted to: 



provide instructions to the DMA unit for performing 
an import of a respective plurality of sets of data points, of 
N data points, into the internal memory; 

perform a FFT computation upon each one of the 
respective plurality of sets of data points, of the N data 
points, in the internal memory; and 

provide instructions to the DMA unit for performing 
an export of the respective plurality of sets of data points, 
of the N data points, from the internal memory to update the 
respective plurality of sets of data points of the N data 
points . 

21. A processor according to claim 20 further comprising 
an external memory adapted to store the N data points. 

22. A processor according to claim 21 comprising at least 
one bus wherein the DMA unit is further adapted to import the 
respective plurality of sets of data points, of the N data 
points, from the external memory and to export the respective 
plurality of sets of data points, of the N data points, into 
the external memory through the at least one bus. 

23. A processor according to claim 20 wherein the 
internal memory comprises: 

a double buffer comprising two buffers each adapted 
to stored data points; 

two buses each adapted to provide a channel for 
importing the data points into a respective one of the two 
buffers and for exporting the data points from the respective 
one of the two buffers; and 
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the CPU being further adapted to provide instructions 
to the DMA unit for importing a respective plurality of sets of 
data points, of the N data points, into one of the two buffers 
while the CPU performs FFT computations upon another respective 
plurality of sets of data points, of the N data points, stored 
in another one of the two buffers. 

24. A processor according to claim 20 wherein the 
internal memory comprises: 

a double buffer comprising two buffers each adapted 
to stored data points; 

two buses each adapted to provide a channel for 
importing the data points into a respective one of the two 
buffers and for exporting the data points from the respective 
one of the two buffers; and 

the CPU being further adapted to provide instructions 
to the DMA unit for exporting respective sets of data points 
from one of the two buffers while the CPU performs FFT 
computations upon other respective sets of data points stored 
in another one of the two buffers. 

25. A processor according to claim 20 wherein N 
substantially satisfies N = 2 k with k being an integer and 
wherein the stage groupings of at least one stage comprise Q 
stage groupings each comprising S stages, said Q and said S 
being integers with Q>1, S>1 and k>QS. 

26. A processor according to claim 20 wherein the CPU is 
adapted to: 

group, within a stage grouping, I, of the Q stage 
groupings wherein I is an integer satisfying 0<I<Q-1, the N 



data points into 2 IS blocks of data of N/2 IS respective data 
points; 

group, within each one of the 2 IS blocks of data, the 
N/2 IS respective data points into 2 s sub-blocks of data of N/2 IS+S 
respective data points; 

group, within each one of the 2 s sub-blocks of data, 
the N/2 IS+S respective data points into Mi sub-sub-blocks of data 
of N/(2 IS+S Mi) respective data points wherein Mi is an integer; 

take, within each one of the 2 IS blocks of data, a 
reS p ec tive sub-sub-block of data of the Mi sub-sub-blocks of 
data from each one of the 2 s sub-blocks of data to form a set of 
2 s sub-sub-blocks of data of N/(2 IS Mi) data points a total of Mi 
times to produce Mi sets of 2 s sub-sub-blocks of data of 
N/(2 IS Mi) data points; and 

take, within each one of the Mi sets of 2 s sub-sub- 
blocks of data, a respective data point from each one of the 2 
sub-blocks of data to form a set of 2 s data points a total of 
N/(2 IS+s Mi) times to produce N/(2 IS+s Mi) sets of 2 s data points. 

27. A processor according to claim 26 wherein, for the 

stage grouping, I, of the of the Q stage groupings of the 
plurality of stage groupings, the CPU is adapted to perform M x 
of the plurality operations for each one of the 2 IS blocks of 
data within the stage grouping, I, and for each one of the Mi of 
the plurality operations the CPU is adapted to: 

provide instructions to the DMA unit for performing 
an import of respective N/(2 IS+s Mi) sets of 2 s data points into 
the internal memory; 



performing an S-stage FFT computation of the k-stage 
FFT computation upon each set of 2 s data points of the 
respective N/(2 IS+s Mi) sets of 2 s data points in the internal 
memory; and 

provide instructions to the DMA unit for performing 
an export of the respective N/(2 IS+s Mi) sets of 2 s data points 
from the internal memory to update the respective N/ (2 IS+s Mi) 
sets of 2 s data points . 

28. A processor according to claim 27 wherein the 
internal memory comprises a buffer adapted to store the 
plurality of sets of data points, wherein the buffer has a 
capacity to hold M T data points and M x substantially satisfies 
Mi = N/(2 IS Mi) wherein M T in an integer. 

29. A processor according to claim 27 wherein the 
internal memory comprises a double buffer adapted to store the 
plurality of sets of data points, wherein the double buffer 
comprises two buffers each capable of storing Mi/2 data points 
and wherein said Mi substantially satisfies Mi = N/(2 IS ' 1 M I ), Mi 
being an integer. 

30. A processor according to claim 29 wherein while the 
CPU performs the S-stage FFT computation of the k-stage FFT 
computation upon each set of 2 s data points of the respective 
N/(2 IS+s Mi) sets of 2 s data points, the respective N/ (2 IS+s Mi) sets 
of 2 s data points being stored in one of the two buffers, the 
CPU is adapted to provide instructions to the DMA unit to 
import, into another one of the two buffers, respective 
N/(2 IS+s Mi) sets of 2 s data points of a next one of the each one 
of the Mi of the plurality operations. 

31. A processor according to claim 29 wherein while the 
CPU performs the S-stage FFT computation of the k-stage FFT 
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computation upon each set of 2 s data points of the respective 
N/(2 IS+s Mi) sets of 2 s data points, the respective N/ (2 IS+s Mi) sets 
of 2 s data points being stored in one of the two buffers, the 
CPU is adapted to provide instructions to the DMA unit to 
5 export, from another one of the two buffers, respective 

N/ (2 IS+s Mi) sets of 2 s data points from previous one of the each 
one of the Mi of the plurality operations. 

32. A processor according to claim 27 wherein the 

performing an S-stage FFT computation of the k-stage FFT 
10 computation upon each set of 2 s data points of the respective 
if N/(2 IS+s Mi) sets of 2 s data points results in N/ (2 IS+S Mi) S-stage 

FFT computations, the CPU being further adapted to evaluate the 
N/(2 IS+S Mi) S-stage FFT computations at one stage of S stages 
for each one of the N/(2 IS+S Mi) S-stage FFT computations and 
15 iterate through the S stages. 



33. A processor according to claim 27 wherein the 



U internal memory comprises a high-speed cache. 

m 

34. A processor according to claim 33 wherein the 
performing an S-stage FFT computation of the k-stage FFT 

20 computation upon each set of 2 s data points of the respective 
N/(2 IS+s Mi) sets of 2 s data points results in N/(2 IS+S Mi) S-stage 
FFT computations, the CPU being further adapted to perform 
computations for S stages of an S-stage FFT computation of one 
of the N/(2 IS+S Mi) S-stage FFT computations before performing 
25 another S-stage FFT computation of another one of the N/(2 
Mi) S-stage FFT computations. 

35. A processor according to claim 28 wherein the stage 
groupings of at least one stage comprise a stage grouping of D 
stage wherein D is an integer, said k substantially satisfying 

30 k=QS+D and said D substantially satisfying D<log 2 (Mi). 



36. A processor according to claim 29 wherein the stage 
groupings of at least one stage comprise a stage grouping of D 
stage wherein D is an integer, said k substantially satisfying 
k=QS+D and said D substantially satisfying D<log 2 (Mi/2 ) . 

37. A processor according to claim 20 wherein the CPU is 
adapted to produce the k-stage FFT computation in real-time. 

38. A processor according to claim 20 wherein the 
internal memory comprises a buffer adapted to store twiddle 
factors used in the FFT computations. 

39. An article of manufacture comprising: 

a computer usable medium having computer readable 
program code means embodied therein for performing a k-stage 
FFT computation of N data points wherein k and N are integers, 
the computer readable code means in said article of manufacture 
comprising : 

computer readable code FFT means for performing, for 
each one of a plurality of stage groupings of at least one 
stage of k stages of the k-stage FFT computation wherein the 
plurality of stage groupings of at least one stage collectively 
comprise the k stages, a respective plurality of operations on 
the N data points, for each one of the respective plurality of 
operations the computer readable code FFT means comprising: 

computer readable code means for providing 
instructions for performing an import of a respective plurality 
of sets of data points of the N data points from an external 
memory into an internal memory; 
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computer readable code means for performing a FFT 
computation upon each one of the respective plurality of sets 
of data points of the N data points in the internal memory; and 

computer readable code means for providing 
instructions for performing an export of the respective 
plurality of sets of data points of the N data points from the 
internal memory into the external memory to update the 
respective plurality of sets of data points of the N data 
points in the external memory. 

40. An article of manufacture according to claim 39 
wherein the stage groupings of at least one stage comprise a 
stage grouping of D stage and Q stage groupings of S stages 
wherein D, Q and S are integers and wherein said k 
substantially satisfies k=QS+D and k = log 2 (N). 

41. An article of manufacture according to claim 40 
comprising further computer readable code means for: 

grouping, within a stage grouping, I, of the Q stage 
groupings wherein I is an integer satisfying 0<I<Q-1, the N 
data points into 2 IS blocks of data of N/2 IS respective data 
points ; 

grouping, within each one of the 2 IS blocks of data, 
the N/2 IS respective data points into 2 s sub-blocks of data of 
N/2 IS+S respective data points; 

grouping, within each one of the 2 s sub-blocks of 
data, the N/2 IS+S respective data points into Mi sub-sub-blocks 
of data of N/(2 IS+S Mi) respective data points wherein M x is an 
integer; 



taking, within each one of the 2 IS blocks of data, a 
respective sub-sub-block of data of the Mi sub-sub-blocks of 
data from each one of the 2 s sub-blocks of data to form a set of 
2 s sub-sub-blocks of data of N/(2 IS Mi) data points a total of M x 
times to produce Mi sets of 2 s sub-sub-blocks of data of 
N/(2 IS Mi) data points; and 

taking, within each one of the Mi sets of 2 s sub-sub- 
blocks of data, a respective data point from each one of the 2 s 
sub-blocks of data to form a set of 2 s data points a total of 
N/(2 IS+s Mi) times to produce N/ (2 IS+s Mi) sets of 2 s data points. 

42. An article of manufacture according to claim 41 

wherein, for the stage grouping, I, of the of the Q stage 
groupings of the plurality of stage groupings, the computer 
readable code FFT means is further adapted to perform Mi of the 
plurality operations for each one of the 2 IS blocks of data 
within the stage grouping, I, and for each one of the Mi of the 
plurality operations the computer readable code FFT means 
further comprising : 

computer readable code means for providing 
instructions for performing an import of respective N/ (2 Mi) 
sets of 2 s data points from the external memory into the 
internal memory; 

computer readable code means for performing an S- 
stage FFT computation of the k-stage FFT computation upon each 
set of 2 s data points of the respective N/(2 IS+s Mi) sets of 2 s 
data points in the internal memory; and 



computer readable code means for providing 
instructions for performing an export of the respective 
N/(2 IS * s Mi) sets of 2 s data points from the internal memory : 
the external memory to update the respective N/ (2 IS+s Mi) set 
2 s data points in the external memory. 



